A Novel Two-Phase SOM Clustering Approach to Discover Visitor Interests in a Website

نویسندگان

  • Ahmad Ammari
  • Valentina V. Zharkova
چکیده

Mining content, structure and usage data in websites can uncover browsing patterns that different groups of Web visitors follow to access the subjects that are truly valuable to them. Many works in the literature focused on proposing new similarity measures to cluster Web logs and detect segments of browsing behaviors. However, this does not reveal which contents the visitors are interested in since a Web page may contain many different topics. In this paper, a novel two-phase clustering approach based on Self Organizing Maps (SOM) is proposed to address this problem. A systematic process to prepare Web content data for clustering is also described.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map

Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...

متن کامل

سیستم پیشنهادگر هوشمند برای خرده‌فروشی اینترنتی با استفاده از نقشه خودسازمانده و قواعد انجمنی بر اساس الگوهای جمعیت‌شناختی مشتریان

  The intensive competition in e-Commerce causes effective methods for customer attraction of special importance. In this regard, the recommender systems in commercial websites can precisely determine customers' interests and needs, and offer them most suitable products and services. In this paper, a new model for recommender systems is proposed which segments the market and customers more effi...

متن کامل

A Novel Fault Detection and Classification Approach in Transmission Lines Based on Statistical Patterns

Symmetrical nature of mean of electrical signals during normal operating conditions is used in the fault detection task for dependable, robust, and simple fault detector implementation is presented in this work. Every fourth cycle of the instantaneous current signal, the mean is computed and carried into the next cycle to discover nonlinearities in the signal. A fault detection task is complete...

متن کامل

An Efficient Machine Learning Regression Model for Rainfall Prediction

Interfacing through the continuously rising amounts of data in technical, medical, scientific, engineering, industrial and monetary fields and their renovation to logical form for the human user is one of the main requirements. To quickly discover and analyze complex patterns and requirements, we need the efficient techniques and need to learn from new data will be necessary for information-int...

متن کامل

Hierarchical Representatives Clustering with Hybrid Approach

Clustering is a discovering process of meaningful intbrmation by grouping similar data into compact clusters. Most of traditional clustering methods are in favor of small datasets and have difficulties handling very large datasets. They are not adequate clustering methods for partitioning huge datasets in data mining perspective. We propose a new clustering technique, HRC(hierarchical represent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010